Image processing apparatus and control method therefor

ABSTRACT

In a data processing apparatus which forms a file having a plurality of pages on the basis of image data obtained by sequentially reading images on a plurality of originals, forms a file having the attribute information and end information representing the end of the file for each page, and transmits the file, when file creation processing is interrupted midway during file creation, the end information of the file is added at the interrupt time, thereby transmitting a normal file. In addition, data to be formed into one file is divided into a plurality of blocks in units of pages, and a plurality of files each having end information are sequentially transmitted. At this time, information for allowing the receiving side to convert the divided files into one file is transmitted. Furthermore, a plurality of data stored in a memory are appropriately selected, one file having a plurality of pages is formed on the basis of the selected data, and a file having the attribute information and file end information for each page is formed and transmitted.

FIELD OF THE INVENTION

[0001] The present invention relates to an image processing apparatuscapable of forming an image file having a plurality of pages from inputimage data and a control method therefor.

BACKGROUND OF THE INVENTION

[0002] In recent years, a technique of connecting an image scanner to acomputer network, scanning an original with the scanner to read theimage on the original, and sending the obtained image data to thecomputer network as an image file has been proposed.

[0003] In transmitting image data by such an image scanner, the userselects a transmission protocol and resolution, monochrome or colormode, format, and transmission destination of the image to betransmitted from the operation section.

[0004] Examples of the transmission protocol are Simple Mail TransferProtocol (to be referred to as SMTP hereinafter), File Transfer Protocol(to be referred to as FTP hereinafter), NetWare, and Server MessageBlock Protocol (to be referred to as SMB hereinafter).

[0005] The resolution of an image is defined by resolutions in the mainscanning and sub-scanning directions, and for example, dot per inch (tobe referred to as dpi hereinafter) is used as a unit. More specifically,when 200 dpi×100 dpi is designated, the resolution in the main scanningdirection is 200 dpi, and that in the sub-scanning direction is 100 dpi.

[0006] When a monochrome image is to be acquired by scanning, forexample, the single page tag image file format (to be referred to asS-TIFF hereinafter), multiple page tag image file format (to be referredto as M-TIFF hereinafter), or portable document format (to be referredto as PDF hereinafter) developed by Adobe is designated as an imageformat.

[0007] When a color image is to be acquired by scanning, for example,the Joint Photographic Experts Group Format (to be referred to as JPEGhereinafter) or PDF is designated. The S-TIFF and JPEG form a file fromthe image of one original. The M-TIFF and PDF form a file from theimages of a plurality of originals.

[0008] The expression of a transmission destination changes depending onthe transmission protocol. A transmission destination for SMTP means ane-mail address, and that for FTP, NetWare, or SMB means a server nameand a user name, password, and directory name on the server. The name ofa file to be transmitted can also be added.

[0009] In such an image scanner, when one file is to be formed from aplurality of pages, as in the M-TIFF or PDF, for example, the imagememory capacity may be short. In addition, when the user inputs aninterrupt instruction or the like, processing is incompletely ended foran image file which is being created for an original image read so far,so the image file cannot be used.

[0010] The reason for this is as follows. In the M-TIFF or PDF, to,e.g., designate and display an arbitrary page, an offset value is addedto the end of each page, and information for determining the file isadded to the end of the final page. If the processing is interrupted,such information cannot be written, and an incomplete file which cannotbe processed by an application compatible to M-TIFF or PDF is formed.

SUMMARY OF THE INVENTION

[0011] It is an object of the present invention to provide an imageprocessing apparatus for solving the above problem.

[0012] It is another object of the present invention to provide an imageprocessing apparatus which can make a file processible by applicationsoftware even when image formation processing based on input image datais interrupted.

[0013] It is still another object of the present invention to provide animage processing apparatus which can form an appropriately dividedmultiple page file from image data obtained by reading images on aplurality of originals.

[0014] It is still another object of the present invention to provide animage processing apparatus which can link a plurality of arbitrary imagedata into one file.

[0015] Other features and advantages of the present invention will beapparent from the following description taken in conjunction with theaccompanying drawings, in which like reference characters designate thesame or similar parts throughout the figures thereof.

BRIEF DESCRIPTION OF THE DRAWINGS

[0016]FIG. 1 is a view showing the arrangement of a system connected toa copying machine according to an embodiment of the present invention;

[0017]FIG. 2 is a block diagram showing the schematic arrangement of thecopying machine according to the embodiment of the present invention;

[0018]FIG. 3 is a block diagram showing a software configuration forcontrolling operation of the copying machine according to the embodimentof the present invention;

[0019]FIG. 4 is a view showing the format of an M-TIFF file;

[0020]FIG. 5 is a view showing the format of a PDF file;

[0021]FIGS. 6, 15, and 16 are flow charts showing processing proceduresaccording to the first embodiment;

[0022] FIGS. 7 to 14 and 24 to 29 are views for explaining the formatsof files formed in the second embodiment;

[0023] FIGS. 17 to 20 are views for explaining the format of a fileformed in the second embodiment;

[0024] FIGS. 21 to 23 are flow charts showing processing proceduresaccording to the second embodiment; and

[0025]FIGS. 30 and 31 are flow charts showing processing proceduresaccording to the third embodiment.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

[0026] (First Embodiment)

[0027] Embodiments of the present invention will be described below indetail with reference to the accompanying drawings.

[0028]FIG. 1 is a view showing the arrangement of a network system usinga copying machine according to an embodiment of the present invention.

[0029] A copying machine 1001 has not only a normal copying function ofreading a plurality of originals and printing the images on printingpaper but also a function of transmitting the read original images toanother device connected to a local area network (to be referred to as aLAN hereinafter) 1006 such as Ethernet developed by Xerox Corp. througha network interface.

[0030] A mail server 1003 for e-mail and a file server 1004 which storesvarious files are computers for storing data and the like sent from thecopying machine 1001. A client computer 1005 is a computer connected tothe mail server 1003 and file server 1004 to acquire or display data.The LAN 1006 is a network to which the copying machine 1001, mail server1003, file server 1004, client computer 1005, and the like areconnected. The mail server 1003 corresponds to a so-called SMTP serveror POP server, and the file server 1004 corresponds to a so-called FTPserver, NetWare server, or SMB server.

[0031] Although FIG. 1 illustrates only the single mail server 1003,file server 1004, and client computer 1005, a plurality of mail servers,file servers, and client computers may be connected to one network.Additionally, one apparatus can be commonly used as the mail server 1003and file server 1004.

[0032]FIG. 2 is a system block diagram of this embodiment, which showsthe arrangement of a controller unit arranged in the copying machine1001. A controller unit 2000 is connected to a scanner 2070 serving asan image input device and printer 2095 serving as an image output deviceand also connected to a LAN 2011 and public line (WAN) 2051 toinput/output image information or device information.

[0033] A CPU 2001 is a controller for controlling the whole system. ARAM 2002 is a system work memory for operation of the CPU 2001 and isalso used as an image memory for temporarily storing image data. A ROM2003 is a boot ROM which stores the boot program of the system. An HDD2004 is a hard disk drive for storing system software, image data, andthe like in a hard disk. An operation section I/F 2006 is an interfacesection to an operation section (UI) 2012 having a touch panel ordisplay section. The operation section I/F 2006 outputs display data tobe displayed on the operation section 2012 to the operation section 2012or transmits to the CPU 2001 information input from the operationsection 2012 by the system user. A network I/F 2010 is connected to theLAN 2011 to input/output information. A modem 2050 is connected to thepublic line 2051 to input/output information. The above devices arearranged on a system bus 2007. An image bus I/F 2005 is a bus bridgewhich connects the system bus 2007 to an image bus 2008 for transferringimage data at a high speed and converts the data structure. The imagebus 2008 is formed from, e.g., a PCI bus or IEEE1394. The above devicesare arranged on the image bus 2008. A raster image processor (RIP) 2060bitmaps a PDL (Page Description Language) to a bitmap image. A deviceI/F section 2020 connects the scanner 2070 and printer 2095 as imageinput/output devices to the controller unit 2000 to synchronously orasynchronously convert image data. A scanner image processing section2080 corrects, processes, and edits input image data from the scanner2070. A printer image processing section 2090 performs correction andresolution conversion corresponding to the printer for print outputimage data to be output to the printer 2095. An image rotation section2030 rotates image data. An image compression section 2040 executescompression/expansion processing by JPEG for multilevel image data andby JBIG scheme, MMR encoding, or MH encoding for binary image data. Thescanner 2070 optically scans and reads the image of an original conveyedfrom an ADF (Auto Document Feeder) one by one or placed on an originaltable under the open ADF and generates electronic image data. Theprinter 2095 prints an image based on input image data on printingpaper.

[0034]FIG. 3 is a block diagram showing a software configuration forimplementing the transmission function of this embodiment. Thesesoftware components are stored in the ROM 2003 and/or HDD 2004 in thecontroller unit 2000 and executed by the CPU 2001.

[0035] An operation section component 3001 is related to control of theoperation section 2012 and acquires a data transmission protocol or theformat and transmission destination of a transmission image by detectinga user instruction based on touch key operation on the operation section2012.

[0036] A transmission management component 3000 can instruct a scannermanagement component 3004 to read an original and an address managementcomponent 3002 to acquire address information in accordance with aprocessing instruction such as the data transmission protocol or theformat and transmission destination of the transmission image acquiredby the operation section component 3001, and transmit or store the readimage data or image data selected from a Box area provided in the HDD2004 in accordance with a designation by the operation section 2012 toor in the printer 2095, a facsimile apparatus connected to the publicline 2051, the file server/mail server connected to the LAN 1006 (2011),or a Box area provided in the HDD 2004 by issuing a transmissionprocessing instruction to a print component 3005, facsimile transmissioncomponent 3006, file server transmission component 3007, mailtransmission component 3008, or Box component 3009. Especially, the fileserver transmission component 3007 and mail transmission component 3008can transmit image data to the file server/mail server 1004 using fourprotocols: FTP, NetWare, SMB, and SMTP. The transmission managementcomponent 3000 is constructed by the address management component 3002and a job management component 3003 which controls jobs between itselfand the scanner management component 3004 and other transmissioncomponents.

[0037] <File Format>

[0038]FIG. 4 is a schematic view of an M-TIFF format in this embodiment.This data format is used by processing in the file server transmissioncomponent 3007 and mail transmission component 3008.

[0039] As for the data format structure of an M-TIFF file 4000, a headerand a corresponding data block form one unit, and a plurality of suchunits are arranged in an order of a header 4001 for the first image dataand a data block 4003 of the first image data, a header 4004 for thesecond image data and a data block 4006 of the second image data, and soon. For example, as the format of N image data, the first imagecompressed by, e.g., MMR is arranged in the data block 4003 of the firstimage data, and pieces of attribute information including the resolutionand compression scheme of the first image and an offset 4002 foridentifying the presence/absence of the next header data are arranged inthe header 4001 for the first image data. The second image compressed byMMR is arranged in the data block 4006 of the second image data, andpieces of attribute information including the resolution and compressionscheme of the second image and an offset 4005 for identifying thepresence/absence of the next header data are arranged in the header 4004for the second image data. This format continues to the last image, and“0” is stored in the header for the last Nth image as an offset to thenext header, thereby identifying that this is the last image, and onefile is completed here.

[0040]FIG. 5 is a schematic view of a PDF format in this embodiment.This data format is used by processing in the file server transmissioncomponent 3007 and mail transmission component 3008.

[0041] The data format structure of a PDF file 5000 has a header 5001,data 5002 of the first image, data 5003 of the second image, . . . data5009 of the Nth image, Pages 5010, Cross Reference 5011, and Trailer5012. The header 5001 is permanently generated. The Trailer 5012 has theposition information or the like of the Cross Reference 5011. The data5002 of the first image is formed from the first image data and piecesof attribute information including the resolution and compressionscheme. The data 5003 of the second image is also formed from the secondimage data and pieces of attribute information including the resolutionand compression scheme. The first data 5002, second data 5003, and thelike are assigned object numbers for uniquely specifying the data in thePDF. The object numbers are described in the Pages 5010. All objectnumbers used in the PDF file 5000 and offset values represented by theobject numbers in the PDF are described in the Cross Reference 5011.

[0042] The file formats shown in FIGS. 4 and 5 are applied to fileconversion in the flow charts of FIGS. 15 and 16.

[0043] <File Transmission>

[0044] Operation of the copying machine 1001 in transmitting a multiplepage file as shown in FIGS. 4 or 5 will be described next.

[0045]FIG. 6 is a flow chart showing the operation procedure of filetransmission. This flow chart shows the flow of processing controlled bythe CPU 2001 on the basis of a program stored in the ROM 2003 and/or HDD2004.

[0046] In step S6001, a Send window for making various designations fortransmission is displayed by touching a [Send] key A1 in the displayshown in FIG. 7 as the basic window (standby window) of the touch panelsection in the operation section 2012.

[0047] To set the read mode in step S6002, a Scan Setting window shownin FIG. 9 is displayed by pressing a [Scan Settings] key A2 in the Sendwindow. Whether the original is to be read as a monochrome image orcolor image is designated by a [Color Mode] key A4, and the readresolution of the image is designated by a [Resolution] key A3, therebysetting the read mode. The Send window is displayed again by pressing an[OK] key A5. The read mode need not always be set at this time and canbe set anytime before a read instruction.

[0048] To set the file type in step S6003, a [Send Settings] key A6 inthe Send window is pressed to display a Send Settings window in FIG. 10.To designate the file format of image data to be transmitted, a desiredfile format (image format) is selected from a list by a [File Type] keyA7. For a monochrome image, S-TIFF, M-TIFF, or PDF can be selected. Fora color image, JPEG or PDF can be selected. In this embodiment, since aformat which converts a plurality of originals into one file is used fortransmission, M-TIFF or PDF is selected for a monochrome image, and PDFis selected for a color image. When the file format is selected, a[Done] key A8 is pressed to display the Send window again. The file typeneed not always be set at this time and can be set anytime before a readinstruction.

[0049] An address for file transmission is set in step S6004. If thedesired address is already registered in Address Book or One-touchButtons 1 or 2, the address is selected therefrom. If the address is notregistered, a [New Recipient] key A9 is pressed to switch the window tothe display shown in FIG. 11 and then designate the desired address type(in this case, [File] is designated to store the file in a specific userfolder in the file server 1004 on the LAN 1006). Subsequently, inaccordance with the display shown in FIG. 12, a protocol to be used forfile transmission, the computer name of the file server 1004, the path(directory) to the file storage destination file, the user name, and apassword are input by a [Protocol] key A11, [Host Name] key A12, [FilePath] key A13, [User] key A14, and [Password] key A15, respectively.When all inputs for transmission destination settings are ended, and an[OK] key A16 is pressed, the window returns to the Send window.

[0050] Thus, various settings for transmission are ended. These settingcontents are put together by the operation section component 3001 andtransmitted to the job management component 3003. The job managementcomponent 3003 processes these pieces of information as one job. Theoperation section component 3001 also makes the operation section 2012display the contents such that the user of the copying machine 1001 canselectively cancel them.

[0051] The job management component 3003 notifies the scanner managementcomponent 3004 of the above setting contents and performs setting forreading. When a plurality of (N) originals are set on the ADF (AutoDocuments Feeder) of the copying machine 1001, or an original is placedon the original table under the open ADF in step S6005, and then thestart button in the operation section 2012 is pressed in step S6006,scan reading of the original is started in accordance with the abovesettings. In reading the originals set on the ADF, the originals areconveyed and sequentially read one by one. In reading the originalplaced on the original table, the original is exchanged by the user.Hence, for the last original, operation for instructing it is performedon the operation section 2012.

[0052] When a read instruction is input in step S6006, the jobmanagement component 3003 notifies the scanner management component 3004of the resolution and the monochrome or color read mode. The scannermanagement component 3004 causes the scanner 2070 to operate in thedesignated monochrome or color read mode at the designated resolution tosequentially read the plurality of originals and obtain input images,and causes the scanner image processing section 2080 to correct,process, and edit the input image data. In addition, the input imagesare rotated using the image rotation section 2030. In the monochromeread mode, the image data are compressed by two-dimensional encoding (tobe referred to as MMR hereinafter) complying with the ITU-Trecommendations, converted into a plurality of S-TIFF files in units ofpages, and stored in the HDD 2004. In the color read mode, the imagedata are compressed by the image compression section 2040 by JPEG,converted into a plurality of JPEG files in units of pages, and storedin the HDD 2004. When storage is ended, the scanner management component3004 notifies the job management component 3003 of the locations of theinput images in the HDD 2004.

[0053] Next, since the transmission protocol is FTP, NetWare, or SMB,the job management component 3003 selects the file server transmissioncomponent 3007 as the job processing destination and notifies the fileserver transmission component 3007 of pieces of information includingthe locations and format of the input images in the HDD 2004, thetransmission protocol, and the transmission destination. After that,when the user performs cancel operation in step S6008, the operationsection component 3001 notifies the job management component 3003 ofcancel of the job, the job management component 3003 notifies the fileserver transmission component 3007 of cancel of the job, and cancelprocessing is executed in step S6009 as will be described later.

[0054] To cancel transmission midway, a [System Monitor] key A18 ispressed on the Send window shown in FIG. 8 to display a System Monitorwindow shown in FIG. 13. In this case, since the job of the Sendfunction is to be canceled, a [Send] key A19 is pressed to display thejob status window of the Send function shown in FIG. 14. “Sending”represents that transmission processing currently progresses, and“Waiting” represents a transmission standby state.

[0055] Assume that the job for which transmission is instructed is a jobA20. The job A20 is pressed, and then, a [Cancel] key A21 is pressed,thereby canceling the transmission job A20. At this time, a dialog forconfirming whether the user really wants to cancel the job may bedisplayed to prevent any operation error.

[0056] The file server transmission component 3007 reads out thereceived protocol, determines the protocol to be used through thenetwork I/F 2010, and accesses the file server/mail server 1004 as thereceived transmission destination. Using the transmitted server name anduser name on the server, the file server transmission component 3007logs in to the designated file server/mail server 1004 and designatesthe directory where a transmitted image file is to be stored. The name(e.g., date/time+extension corresponding to data format=20000222.tif) ofa file to be transmitted is automatically generated and designated fromthe file creation date/time and the designated data format.

[0057] If the received format is M-TIFF, the file server transmissioncomponent 3007 transmits the S-TIFF images input to the HDD 2004 to thefile server/mail server 1004 on the LAN 1006 while converting them intoan M-TIFF file. When the transmission protocol is SMTP, these processingoperations are performed by the mail transmission component 3008.

[0058] <File Transmission and Conversion>

[0059] The flow of processing of converting image data obtained byreading images on originals into an M-TIFF file and of transmitting itwill be described next with reference to the flow chart shown in FIG.15. This flow chart shows the flow of processing controlled by the CPU2001 on the basis of a program stored in the ROM 2003 and/or HDD 2004.

[0060] When conversion into an M-TIFF file and transmission are startedin step S15001 in FIG. 15, a variable i representing the number oforiginals to be transmitted is initialized to 1 in step S15002. Thetotal number N of the originals and the variable i are compared in stepS15003. If i<N, the flow advances to step S15004.

[0061] The pieces of attribute information of the first original imageinput to the HDD 2004 are acquired in step S15004 to generate the firstheader 4001. The offset 4002 to the next header can be calculated byreading the image size in the attribute information of the first image.When the first header 4001 is generated, the first header 4001 istransmitted to the file server/mail server 1004. The first data block4003 means the first image portion input to the HDD 2004. The data block4003 is read out from the HDD 2004 and transmitted to the fileserver/mail server 1004. The variable i is incremented by one. Next, itis checked in step S15005 whether a cancel notification is received.More specifically, when the first data block is completely transmitted,it is checked whether a cancel notification for the job is received. IfNO in step S15005, the same operation as described above is repeated forthe second and subsequent images. When the Mth (M<N) data block iscompletely transmitted, and a cancel notification for that job has beenreceived at that time, the offset value to the (M+1)th header hasalready been stored in the Mth header as an offset to the next header.Hence, in step S15006, the (M+1)th header is generated by substituting“0” into the offset to the next header in the (M+1)th header and istransmitted, and the (M+1)th data block is transmitted, thus completingtransmission of the job.

[0062] If the received format is PDF, the file server transmissioncomponent 3007 must transmit the S-TIFF images input to the HDD 2004 tothe file server/mail server 1004 while converting them into a PDF file.FIG. 16 is a flow chart showing the flow of conversion into a PDF fileand transmission of it in this embodiment. This flow chart shows theflow of processing controlled by the CPU 2001 on the basis of a programstored in the ROM 2003 and/or HDD 2004. Conversion into a PDF file andtransmission are started in step S16001.

[0063] First, the header 5001 is generated and transmitted, and thevariable i representing the number of originals to be transmitted isinitialized to 1 in step S16002. The total number N of the originals andthe variable i are compared in step S16003. If i≦N, the flow advances tostep S16004. The first data 5002 is generated by reading out first imageand pieces of attribute information of that image input to the HDD 2004,and is transmitted, and the variable i for counting the number oforiginals is incremented by one in step S16004.

[0064] It is checked next in step S16005 whether a cancel notificationis received. If NO in step S16005, the same operation as described aboveis repeated for the second and subsequent images.

[0065] If the Mth (M<N) data is transmitted, and a cancel notificationfor that job has been received at that time, the Pages 5010 whichdescribes object numbers of the first to Mth images is generated andtransmitted, the Cross Reference 5011 which describes all the objectnumbers used in the PDF file 5000 and offset values represented by theobject numbers in the PDF is generated and transmitted, and the Trailer5012 is generated and transmitted in step S15006, thereby endingtransmission of the job.

[0066] In the above embodiment, a case wherein a job is canceled by auser instruction has been described. This also applies to a case whereina job is canceled due to an error or the like in the apparatus withoutintervening the user.

[0067] According to the above embodiment, when an interrupt of imagedata transmission is instructed during transmission, the interruptinstruction is received, and image data transmitted until the interruptare re-converted as one image data, thereby completing datatransmission. This allows smooth image processing convenient for theuser without creating any abnormal file.

[0068] (Second Embodiment)

[0069] In the above-described first embodiment, in transmitting aplurality of originals as one file having attribute information of eachpage, even when transmission is canceled midway, the file is completedsuch that image data that are already transmitted can be processed as anormal file on the receiving side. This prevents the memory area frombeing wasted for transmission of an abnormal incomplete file and alsoallows re-transmission from the interrupted portion.

[0070] In the second embodiment to be described below, for example, whenthe work area for the Send function set in a RAM 2002 of a copyingmachine 1001 has an upper limit, in the PDF transmission mode, an offsetvalue representing the storage place of image data to be transmitted andits attribute information must be described in a Cross Reference 5011together with the object number of the image data. For this reason, thenumber of images that can be converted into the PDF format andtransmitted is limited. To prevent this, in the second embodiment to bedescribed below, after the Cross Reference 5011 is generated andtransmitted within the allowable range of the work area, and a Trailer5012 is transmitted to complete one PDF file, such an expression isadded that a header and subsequent image data are additionally writtenin the PDF file, another PDF file including a Cross Reference andTrailer related to the added image data is added, and the two PDF filesare linked to express them as one PDF file, thereby preventing anylimitation on the number of images that can be converted into the PDFformat and transmitted.

[0071] This processing will be described below in detail.

[0072] The arrangements of the system and copying machine 1001 are thesame as in FIGS. 1 to 3, and a detailed description thereof will beomitted.

[0073]FIG. 17 is a view for explaining the schematic data format of aPDF file in this embodiment. In this embodiment, the upper limit numberof scanned images that can be converted into one file is M. FIG. 17shows a data format when the total number of images to be transmitted is2M.

[0074] A file server transmission component 3007 (or mail transmissioncomponent 3008) creates a PDF file as shown in FIG. 17 from scannedimage data. A PDF file 17000 has a header 17001, and then, data arecontinuously arranged in an order of a first data 17002, second data17003, . . . and Mth data 17009 in correspondence with the upper limitnumber of image data. A first Pages 17010, first Cross Reference 17011,and first Trailer 17012 continue next to the Mth data 17009. Then, dataare continuously arranged in an order of (M+1)th data 17022, (M+2)thdata 17023, . . . , and 2Mth data 17029. Like the portion next to theMth data 17009, a second Pages 17030, second Cross Reference 17031, andsecond Trailer 17032 continue next to the 2Mth data 17029.

[0075] Of these portions, the header 17001 can be permanently generated.The first data 17002 is formed from the first original image and piecesof attribute information including the resolution and compression schemeof the image. The second data 17003 is also formed from the second imageand pieces of attribute information including the resolution andcompression scheme of the image. The subsequent data have the samestructure as described above. The Mth data 17009 is also formed from theMth image and pieces of attribute information including the resolutionand compression scheme of the image. The first data 17002, second data17003, and the like are assigned object numbers which are unique in thePDF file 17000.

[0076] These object numbers are described in the first Pages 17010. Allobject numbers which have appeared before the first Cross Reference17011 and pieces of position information represented by the objectnumbers are described in the first Cross Reference 17011. The positioninformation of the first Cross Reference 17011 is described in the firstTrailer 17012.

[0077] The (M+1)th data 17022 is formed from the (M+1)th image andpieces of attribute information including the resolution and compressionscheme of the image. The (M+2)th data 17023 is also formed from the(M+2)th image and pieces of attribute information including theresolution and compression scheme of the image. The subsequent data havethe same structure as described above. The 2Mth data 17029 is alsoformed from the 2Mth image and pieces of attribute information includingthe resolution and compression scheme of the image. The (M+1)th data17022, (M+2)th data 17023, and the like are also assigned object numberswhich are unique in the PDF file 17000.

[0078] The object numbers of the first image data 17001 to the 2Mthimage data 17029 are described in the second Pages 17030. That the firstPages 17010 are updated by the second Pages 17030, the positioninformation of the second Pages 17030, the object number of the (M+1)thdata 17022, and the values of position information of all objects whichhave appeared from the (M+1)th data 17022 to the second Cross Reference17031 are described in the second Cross Reference 17031.

[0079] The position information of the first Cross Reference 17011 andthat of the second Cross Reference 17031 are described in the secondTrailer 17032. With this structure, the side that has received this filecan form one PDF file by adding pieces of information on which the firstPages, Cross Reference, and Trailer are reflected to the second Pages,Cross Reference, and Trailer and deleting the first Pages, CrossReference, and Trailer. A complete PDF file also contains otherelements, though they are irrelevant to this embodiment, and a detaileddescription and illustration thereof in FIG. 17 are omitted.

[0080]FIG. 18 is a view showing a detailed description example of thesecond Pages 17030. Referring to FIG. 18, the upper limit number M ofconvertible scanned images is 200, and the total number 2M of scannedimages is 400. A row 18001 of “/Kids” describes the object numbers ofthe first to 2Mth image data 17001 to 17029. The object number of thefirst image data 17001 is “10 R”, and the object number of the 2Mthimage data 17029 is “2394 0 R”. A row “/Count400” represents the numberof object numbers of all images. This value is 2M=400.

[0081]FIG. 19 is a view showing a detailed description example of thesecond Cross Reference 17031 shown in FIG. 17. As in FIG. 18, the upperlimit number M of convertible scanned images is 200, and the totalnumber 2M of scanned images is 400. A row 19001 describes “2 1”representing that the first Pages 17010 is updated by the second Pages17030. Next, the position of the second Pages is described in a row19002, the object number (1200) of the (M+1)th data 17022 is describedin a row 19003, and the position information of the (M+1)th data 17022is described in a row 19004. From the row 19004, the values of positioninformation of all object numbers which have appeared before the secondCross Reference 17031 are described.

[0082]FIG. 20 is a view showing a detailed description example of thesecond Trailer 17032 shown in FIG. 17. Referring to FIG. 20 as well, theupper limit number M of convertible scanned images is 200, and the totalnumber 2M of scanned images is 400. In the second Trailer 17032, theposition information of the immediately preceding (first) CrossReference 17011 is described in a row 20001, and the positioninformation of the second Cross Reference 17031 is described in a row20002.

[0083] The flow in executing transmission processing according to thesecond embodiment in the copying machine of this embodiment will besequentially described below.

[0084]FIG. 21 is a flow chart showing the flow of processing procedureof the copying machine 1001 according to the second embodiment. Thisflow chart shows the flow of processing controlled by a CPU 2001 on thebasis of a program stored in a ROM 2003 and/or HDD 2004.

[0085] Processes in step S21001 to S21006 are the same as theabove-described processes in steps S6001 to S6006 in FIG. 6, and adetailed description thereof will be omitted.

[0086] In file type setting in step S21003, however, the PDF file isdesignated. Additionally, in steps S21005 and S21006, regardless ofwhether the originals are set in the ADF or placed on the originaltable, the originals are processed as a series of originals (originalswhich should be contained in a common file) until the final original isdesignated (the last bundle of originals when the ADF is used).

[0087] In step S21007, reading of the image of each original andtransmission processing according to the procedure to be described laterare started on the basis of contents set in steps S21001 to S21004. Theprocessing is continued until the last original is determined in stepS21008.

[0088] When it is determined that transmission is ended, the series oftransmission processing operations are ended.

[0089] A series of processing operations related to creation andtransmission of a PDF file will be described next with reference to thedata structure shown in FIG. 17 and the flow chart shown in FIG. 22.Referring to FIG. 22, the upper limit number of pages of images that canbe converted at a time is M, and the total number of pages of a seriesof images to be transmitted is N (N>M).

[0090] When creation and transmission processing of a PDF file arestarted in step S22001, the file server transmission component 3007generates and transmits the header 17001 in step S22002 and initializesthe variable i representing the number of transmitted images to 0.

[0091] Next, a variable j representing the number of converted images isinitialized to 0 in step S22003.

[0092] The total number N of originals is compared with the variable iin step S22004. If the variable i is smaller than the number N, the flowadvances to step S22005. If the variable i is equal to or larger thanthe number N, i.e., when all images are transmitted, the flow advancesto step S22008.

[0093] The upper limit number M of images that can be converted at atime is compared with the variable j in step S22005. If the variable jis smaller than the number M, the flow advances to step S22006. If thevariable j is equal to or larger than the number M, i.e., when thenumber of images reaches the upper limit, the flow advances to stepS22007.

[0094] In step S22006, the attribute information and data of the ithoriginal image are acquired, data is generated and transmitted, and thevariables i and j are incremented by one. Then, the flow returns to stepS22004.

[0095] When transmission of the first to Mth images is ended, in stepS22007, the object numbers of the first-data 17002 to the ith data 17009are written in the first Pages 17010, pieces of position information ofthe object numbers are written in the first Cross Reference 17011, andthe first Trailer 17012 is generated and transmitted. Then, the flowreturns to step S22003.

[0096] When the processing in step S22007 is executed for the firsttime, the position information of the generated Cross Reference isstored. From processing for the nth (n is a natural number or 2 or more)time, the object numbers of the data of the first to (n×M)th images aredescribed in the nth Pages. The nth Cross Reference sequentiallydescribes that the (n−1)th Pages is updated by the nth Pages, theposition information of the nth Pages, the object number of the((n−1)×M+1)th data, and the values of position information of allobjects that have appeared from the ((n−1)×M+1)th to ith data. Ingenerating the nth Trailer, the position information of the precedingCross Reference is written, and the position information of the newlygenerated Cross Reference is stored.

[0097] When transmission of the first to Nth images is ended, in stepS22008, the object numbers of data of the first to Nth images arewritten in the Pages, that the Pages of the immediately preceding fileblock are updated by the newly generated Pages, the position informationof the newly generated Pages, the object number of the (N−j)th data, andpieces of position information of all objects that have appeared fromthe (N−j)th data to the newly generated Cross Reference are written inthe Cross Reference, and a Trailer in which the position information ofthe preceding Cross Reference and that of the newly generated CrossReference are written is generated and transmitted. Then, the flowadvances to step S22009.

[0098] In step S22009, conversion to the PDF file and transmissionprocessing are ended.

[0099] In the above embodiment, the upper limit number of pages is usedas a reference for division. However, the present invention is notlimited to this, and the data amount (M bytes) of image data may beused.

[0100] In the above-described embodiment, a case wherein a PDF file isformed has been described. However, the present invention can also beapplied as long as a file having a data format which puts a plurality ofimage data together into one file is to be created, and the same problemas described above is posed.

[0101] In this case as well, a plurality of file blocks are formed inaccordance with the amount of original images with which a file can beformed, and transmitted with additional information representing thecorrelation between the blocks, and on the receiving side, the blocksare processed as a series of files on the basis of the additionalinformation.

[0102] In the above-described embodiment, information capable ofspecifying an immediately preceding file block is added to second andsubsequent blocks. However, information capable of specifying asucceeding file block may be added.

[0103] According to the above-described second embodiment, sinceinformation representing that information of Pages of an immediatelypreceding file block is updated by the current Pages is written in thesecond (final) Cross Reference, the side that has received this file candelete the previous Cross Reference or Pages, so the file to be finallyformed can be compact and simple.

[0104] In addition, since each of divide files is segmented in units ofpages and has a Cross Reference and pages, the divided files (files eachhaving multiple pages) are completed as files by themselves. For thisreason, the files can be processed as independent files depending on theapplication on the receiving side without writing pieces of informationof the remaining divided files in the final Cross Reference and Pagesand deleting the Cross Reference and Pages of each file, unlike theabove embodiment. Hence, even in an environment where a large filecannot be processed, the respective files can be appropriatelyprocessed.

[0105] (Third Embodiment)

[0106] In the above embodiment, offset values, Cross Reference, Pages,and Trailer are controlled whereby a file whose processing isinterrupted midway is completed at that time or a large quantity ofimage data is divisionally transmitted.

[0107] In the third embodiment to be described below, a plurality ofarbitrary image data are selected, linked and reconverted into one file,and transmitted using the technique of controlling offset values and thelike.

[0108] The procedure will be described below in detail.

[0109]FIG. 23 is a flow chart showing the flow of operation procedure ofa copying machine 1001 according to the third embodiment. This flowchart shows the flow of processing controlled by a CPU 2001 on the basisof a program stored in a ROM 2003 and/or HDD 2004.

[0110] In steps S23001 and S23002, the read mode is set as in the abovedescribed embodiments.

[0111] To do Box designation in step S23003, a [Store In Box] key A22 ispressed on the Send Settings window shown in FIG. 10 to display theStore In Box window shown FIG. 24.

[0112] In the copying machine 1001, the HDD 2004 is divided into aplurality of areas such that various kinds of image data can be stored.Individual divided storage destinations are called Boxes discriminatedby Box numbers. Assume that a Box A23 with Box number 00 is selected.When an [OK] key A24 is pressed, the storage destination is set, and thewindow returns to the Send Settings window. Subsequently, a [Done] keyA8 is pressed to return the window to the Send window.

[0113] When it is confirmed that an original is set in the ADF or on theoriginal table in step S23004, and the start button on an operationsection 2012 is pressed in step S23005, reading of the image on theoriginal is started on the basis of the set contents. In step S23006, ifthe read image data is a monochrome image, it is stored in Box 00 as anS-TIFF file, and if the image data is a color image, it is stored asJPEG file. At this time, the read mode, i.e., color or monochrome, theimage size, the number of pages, and the date/time of storage are alsostored in correspondence with the image data. When storage is ended, ascanner management component 3004 notifies a job management component3003 of the location of the stored image in the HDD 2004. The jobmanagement component 3003 notifies a Box component 3009 of the locationof the stored image in the HDD 2004 and its Box number. The Boxcomponent 3009 stores, in the HDD 2004, the relationship between thelocation of the image stored in the HDD 2004 and the Box number for theimage. A number of information related to a plurality of various imagedata stored at various times are stored in the Box component 3009together with Box numbers. The image stored at this time is not limitedto an image from a scanner 2070. An image from various input sourcessuch as a network I/F 2010 or modem 2050 may also be stored.

[0114] The above processing can be executed anytime.

[0115] In step S23007, to select desired image data in the Box, a [MailBox] key A25 is pressed on the basic window shown in FIG. 7 to displaythe Mail Box window shown in FIG. 25. In this case, to select aplurality of image data stored in Box 00, a key A26 is pressed on theMail Box window to display the contents of Box 00. On the content checkwindow of Box number 00 in FIG. 26 (this window is displayed on thebasis of information obtained by inquiring an operation sectioncomponent 3001), the user selects three image data A27, A28, and A29 inthe order named as image data to be linked and transmitted withreference to the image sizes and storage dates/times. The operationsection component 3001 stores the selected image data together with theorder thereof, and the selected order is displayed at the left end (1 to3).

[0116] In step S23008, a [Send] key A30 is pressed for transmissionsetting to display the Sending Address Settings window shown in FIG. 27.In this case, to transmit the image data to an address registered in theAddress Book in advance, an [Address Book] key A31 is pressed to displaythe Address Book window shown in FIG. 28. To select transmission tomarc@canon.com by e-mail and to matt@canon.com by Internet FAX, the userpresses keys A32 and A33. In accordance with this selection, thetransmission protocol (SMTP) and pieces of information necessary fortransmission (user name and password necessary for an access to the SMTPserver) are set in addition to the e-mail addresses. When FTP isselected as the transmission protocol, the address information containsthe server name, the user name on the server, the password, and thedirectory name. After login processing, storage processing is executed.

[0117] After the keys A32 and A33 are selected, an [OK] key A34 ispressed to set these addresses. The window returns to the SendingAddress Settings window. As shown in FIG. 29, the set addresses areinput.

[0118] When the file type is set in step S23009 as described above, andthe start button of the operation section 2012 is pressed in stepS23010, the three selected image data are linked in their order andtransmitted as one file according to a processing procedure (to bedescribed later) in step S23011.

[0119] As the image format for transmission, S-TIFF, M-TIFF, or PDF canbe selected for a monochrome image, and JPEG or PDF can be selected fora color image or a combination of monochrome and color images. However,as the above-described data format capable of putting image data intoone file, M-TIFF or PDF need be selected for only a monochrome image,and PDF need be selected for only a color image or for a combination ofmonochrome and color images. In the following description, assume thatsuch a data format for converting images of a plurality of originalsinto one file is selected.

[0120] The transmission start timing should be switched in accordancewith the transmission protocol. If generation and transmission of a filecan be executed in parallel, they are done so. If not, transmission isstarted after completing a file.

[0121] When it is determined in step S23012 that transmission is ended,the series of transmission processing operations are ended.

[0122] In the flow charts shown in FIGS. 30 and 31, the file formatsshown in FIGS. 4 and 5 are referred to.

[0123] The flow charts in FIGS. 30 and 31 show the flows of processingcontrolled by the CPU 2001 on the basis of a program stored in the ROM2003 and/or HDD 2004.

[0124]FIG. 30 is a flow chart showing processing of linking a pluralityof image data and transmitting them as one file, which is executed bythe copying machine according to this embodiment. The processing shownin FIG. 30 is executed by a file server transmission component 3007 ormail transmission component 3008 in accordance with the image datatransmission destination. In the following description, the number oftransmission image data selected by the user is N (N is a natural numberof 2 or more).

[0125] When conversion into an M-TIFF file and transmission of the fileare started in step S30001, a variable i representing the number oftransmitted images is initialized to 1 in step S30002. The total numberN of images to be transmitted is compared with the variable i in stepS30003. If the variable i is smaller than the number N, the flowadvances to step S30004. If the variable i is equal to or larger thanthe number N, the flow advances to step S30006.

[0126] In step S30004, pieces of attribute information of the image ofthe ith original are acquired from a Box where the image is stored, anoffset value to a header is obtained from the data size, and a header isgenerated and transmitted. Simultaneously, the stored image data is readout and transmitted. When transmission is ended, the variable i isincremented by one, and the flow returns to step S30003.

[0127] When the variable i equals the number N, transmission processingis executed in step S30006, as in step S30004, and conversion andtransmission processing are ended. As described above, 0 is stored as anoffset value to the next header, which is contained in an Nth header4011.

[0128]FIG. 31 is a flow chart showing processing of conversion from anS-TIFF file or JPEG file to a PDF file and transmission processing,which are executed by the copying machine according to this embodiment.The processing shown in FIG. 31 is executed by the file servertransmission component 3007 or mail transmission component 3008 inaccordance with the image data transmission destination. In thefollowing description, the number of transmission image data selected bythe user is N (N is a natural number of 2 or more).

[0129] When conversion into a PDF file and transmission of the file arestarted in step S31001, a header is generated, and the variable irepresenting the number of transmitted images is initialized to 1 instep S31002.

[0130] The total number N of images to be transmitted is compared withthe variable i in step S31003. If the variable i is equal to or smallerthan the number N, the flow advances to step S31004. If the variable iis larger than the number N, the flow advances to step S31006. In stepS31004, pieces of attribute information and data of the ith image dataare acquired from a Box where the image data is stored, data isgenerated and transmitted, and the variable i is incremented by one.Then, the flow returns to step S31003.

[0131] When the variable i is bigger than the number N, the objectnumbers of the first to Nth images are written in Pages 5010, offsetvalues thereof are written in a Cross Reference 5011, and a Trailer 5012is generated and transmitted in step S31006.

[0132] In the above-described embodiments, only an arrangement in whichimage data temporarily stored as a data format such as S-TIFF or JPEG isconverted into the M-TIFF or PDF format in transmission has beendescribed. A plurality of originals may be converted into a file formatsuch as M-TIFF or PDF at the time of reading and storing the imageswithout any conversion in transmission.

[0133] The client computer 1005 may have a remote control function forthe operation section 2012 of the copying machine 1001 such that theuser can input from the client computer 1005 an instruction that can beselected or input using the operation section 2012 of the copyingmachine 1001.

[0134] The encoding scheme used in storing image data or the data formatfor putting a plurality of stored image data into one file are notlimited to those described in the above embodiments, and any otherscheme or format can be used.

[0135] The present invention may be applied to a system constituted by aplurality of devices (e.g., a host computer, an interface device, areader, a printer, and the like) or an apparatus comprising a singledevice (e.g., a copying machine, a facsimile apparatus, or the like).

[0136] The object of the present invention is achieved even by supplyinga storage medium (or recording medium) which records software programcodes for implementing the functions of the above-described embodimentsto the system or apparatus and causing the computer (or a CPU or MPU) ofthe system or apparatus to read out and execute the program codes storedin the storage medium. In this case, the program codes read out from thestorage medium implement the functions of the above-describedembodiments by themselves, and the storage medium which stores theprogram codes constitutes the present invention. The functions of theabove-described embodiments are also implemented not only when thereadout program codes are executed by the computer but also when theoperation system (OS) running on the computer performs part or all ofactual processing on the basis of the instructions of the program codes.

[0137] The functions of the above-described embodiments are alsoimplemented when the program codes read out from the storage medium arewritten in the memory of a function expansion board inserted into thecomputer or a function expansion unit connected to the computer, and theCPU of the function expansion board or function expansion unit performspart or all of actual processing on the basis of the instructions of theprogram codes.

[0138] When the present invention is applied to the storage medium, thestorage medium stores program codes corresponding to the above-describedflow charts.

[0139] As many apparently widely different embodiments of the presentinvention can be made without departing from the spirit and scopethereof, it is to be understood that the invention is not limited to thespecific embodiments thereof except as defined in the appended claims.

What is claimed is:
 1. A data processing apparatus comprising: inputmeans for inputting data; creation means for creating a file on thebasis of the data input by said input means, the file having a pluralityof pages, and a page attribute and end information representing an endof data for each page; transmission means for transmitting the filecreated by said creation means; and addition means for, when filecreation by said creation means is interrupted, adding end informationat interrupt time, wherein when file creation by said creation means isinterrupted, said transmission means transmits the file having the endinformation added by said addition means.
 2. The apparatus according toclaim 1 , wherein said input means inputs the data from a reader forreading an image on an original to generate data representing the image.3. The apparatus according to claim 1 , wherein said creation meanscreates a file with an M-TIFF or PDF format.
 4. The apparatus accordingto claim 1 , wherein said apparatus further comprises an operationsection for inputting a manual instruction by a user, and the interruptis executed on the basis of an interrupt instruction from said operationsection.
 5. The apparatus according to claim 1 , wherein saidtransmission means transmits the data on the basis of a file transferprotocol.
 6. A data processing method comprising: the input step ofinputting data; the creation step of creating a file on the basis of thedata input in the input step, the file having a plurality of pages, anda page attribute and end information representing an end of data foreach page; the transmission step of transmitting the file created in thecreation step; and the addition step of, when file creation in thecreation step is interrupted, adding end information at interrupt time,wherein when file creation in the creation step is interrupted, thetransmission step comprises transmitting the file having the endinformation added in the addition step.
 7. A computer-readable programstored in a computer-readable storage medium, comprising: the input stepof inputting data; the creation step of creating a file on the basis ofthe data input in the input step, the file having a plurality of pages,and a page attribute and end information representing an end of data foreach page; the transmission step of transmitting the file created in thecreation step; and the addition step of, when file creation in thecreation step is interrupted, adding end information at interrupt time,wherein when file creation in the creation step is interrupted, thetransmission step comprises transmitting the file having the endinformation added in the addition step.
 8. A data processing apparatuscomprising: input means for inputting data; creation means for creatinga file on the basis of the data input by said input means, the filehaving a plurality of pages, and a page attribute and end informationrepresenting an end of data for each page; transmission means fortransmitting the file created by said creation means; and division meansfor dividing, in units of pages, the data to be transmitted by saidtransmission means, wherein said division means adds a page attributeand end information to each of the divided data, like said creationmeans, and said transmission means sequentially transmits the datadivided by said division means.
 9. The apparatus according to claim 8 ,wherein said division means divides an amount of data when the datainput by said input means exceeds a predetermined data amount.
 10. Theapparatus according to claim 8 , wherein said input means inputs datarepresenting an image from a reader for reading an image on an original.11. The apparatus according to claim 8 , wherein said creation meanscreates a file with a PDF format.
 12. The apparatus according to claim 8, wherein said division means adds, to one of the divided data,information capable of specifying the other data.
 13. A data processingmethod comprising: the input step of inputting data; the creation stepof creating a file on the basis of the data input in the input step, thefile having a plurality of pages, and a page attribute and endinformation representing an end of data for each page; the transmissionstep of transmitting the file created in the creation step; and thedivision step of dividing, in units of pages, the data to be transmittedin the transmission step, wherein the division step comprises adding apage attribute and end information to each of the divided data, like thecreation step, and the transmission step comprises sequentiallytransmitting the data divided in the division step.
 14. Acomputer-readable program stored in a computer-readable storage medium,comprising: the input step of inputting data; the creation step ofcreating a file on the basis of the data input in the input step, thefile having a plurality of pages, and a page attribute and endinformation representing an end of data for each page; the transmissionstep of transmitting the file created in the creation step; and thedivision step of dividing, in units of pages, the data to be transmittedin the transmission step, wherein the division step comprises adding apage attribute and end information to each of the divided data, like thecreation step, and the transmission step comprises sequentiallytransmitting the data divided in the division step.
 15. A dataprocessing apparatus comprising: input means for inputting data in unitsof pages; storage means for storing a plurality of data input by saidinput means; selection means for selecting a plurality of data from theplurality of data stored by said storage means; creation means forcreating a file from the plurality of data selected by said selectionmeans, the file having a page attribute and end information representingan end of the data for each page; and transmission means fortransmitting the data created by said creation means.
 16. The apparatusaccording to claim 15 , wherein said creation means creates a fileaccording to an order selected by said selection means.
 17. Theapparatus according to claim 15 , wherein said input means inputs thedata from a reader for reading an image on an original to generate datarepresenting the image.
 18. The apparatus according to claim 15 ,wherein said creation means creates a file with an M-TIFF or PDF format.19. A data processing method comprising: the input step of inputtingdata in units of pages; the storage step of storing a plurality of datainput in the input step; the selection step of selecting a plurality ofdata from the plurality of data stored in the storage step; the creationstep of creating a file from the plurality of data selected in theselection step, the file having a page attribute and end informationrepresenting an end of the data for each page; and the transmission stepof transmitting the data created in the creation step.
 20. Acomputer-readable program stored in a computer-readable storage medium,comprising: the input step of inputting data in units of pages; thestorage step of storing a plurality of data input in the input step; theselection step of selecting a plurality of data from the plurality ofdata stored in the storage step; the creation step of creating a filefrom the plurality of data selected in the selection step, the filehaving a page attribute and end information representing an end of thedata for each page; and the transmission step of transmitting the datacreated in the creation step.